generalized framework
Concept Activation Regions: A Generalized Framework For Concept-Based Explanations
Concept-based explanations permit to understand the predictions of a deep neural network (DNN) through the lens of concepts specified by users. Existing methods assume that the examples illustrating a concept are mapped in a fixed direction of the DNN's latent space. When this holds true, the concept can be represented by a concept activation vector (CAV) pointing in that direction. In this work, we propose to relax this assumption by allowing concept examples to be scattered across different clusters in the DNN's latent space. Each concept is then represented by a region of the DNN's latent space that includes these clusters and that we call concept activation region (CAR). To formalize this idea, we introduce an extension of the CAV formalism that is based on the kernel trick and support vector classifiers.
Concept Activation Regions: A Generalized Framework For Concept-Based Explanations
Concept-based explanations permit to understand the predictions of a deep neural network (DNN) through the lens of concepts specified by users. Existing methods assume that the examples illustrating a concept are mapped in a fixed direction of the DNN's latent space. When this holds true, the concept can be represented by a concept activation vector (CAV) pointing in that direction. In this work, we propose to relax this assumption by allowing concept examples to be scattered across different clusters in the DNN's latent space. Each concept is then represented by a region of the DNN's latent space that includes these clusters and that we call concept activation region (CAR).
Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Nair, Nithin Gopalakrishnan, Cherian, Anoop, Lohit, Suhas, Wang, Ye, Koike-Akino, Toshiaki, Patel, Vishal M., Marks, Tim K.
Conditional generative models typically demand large annotated training sets to achieve high-quality synthesis. As a result, there has been significant interest in designing models that perform plug-and-play generation, i.e., to use a predefined or pretrained model, which is not explicitly trained on the generative task, to guide the generative process (e.g., using language). However, such guidance is typically useful only towards synthesizing high-level semantics rather than editing fine-grained details as in image-to-image translation tasks. To this end, and capitalizing on the powerful fine-grained generative control offered by the recent diffusion-based generative models, we introduce Steered Diffusion, a generalized framework for photorealistic zero-shot conditional image generation using a diffusion model trained for unconditional generation. The key idea is to steer the image generation of the diffusion model at inference time via designing a loss using a pre-trained inverse model that characterizes the conditional task. This loss modulates the sampling trajectory of the diffusion process. Our framework allows for easy incorporation of multiple conditions during inference. We present experiments using steered diffusion on several tasks including inpainting, colorization, text-guided semantic editing, and image super-resolution. Our results demonstrate clear qualitative and quantitative improvements over state-of-the-art diffusion-based plug-and-play models while adding negligible additional computational cost.
On a Generalized Framework for Time-Aware Knowledge Graphs
Krause, Franz, Weller, Tobias, Paulheim, Heiko
Knowledge graphs have emerged as an effective tool for managing and standardizing semistructured domain knowledge in a human- and machine-interpretable way. In terms of graph-based domain applications, such as embeddings and graph neural networks, current research is increasingly taking into account the time-related evolution of the information encoded within a graph. Algorithms and models for stationary and static knowledge graphs are extended to make them accessible for time-aware domains, where time-awareness can be interpreted in different ways. In particular, a distinction needs to be made between the validity period and the traceability of facts as objectives of time-related knowledge graph extensions. In this context, terms and definitions such as dynamic and temporal are often used inconsistently or interchangeably in the literature. Therefore, with this paper we aim to provide a short but well-defined overview of time-aware knowledge graph extensions and thus faciliate future research in this field as well.
- Europe > United Kingdom (0.46)
- Europe > Germany (0.04)
A Generalized Framework for Microstructural Optimization using Neural Networks
Sridhara, Saketh, Chandrasekhar, Aaditya, Suresh, Krishnan
Microstructures, i.e., architected materials, are designed today, typically, by maximizing an objective, such as bulk modulus, subject to a volume constraint. However, in many applications, it is often more appropriate to impose constraints on other physical quantities of interest. In this paper, we consider such generalized microstructural optimization problems where any of the microstructural quantities, namely, bulk, shear, Poisson ratio, or volume, can serve as the objective, while the remaining can serve as constraints. In particular, we propose here a neural-network (NN) framework to solve such problems. The framework relies on the classic density formulation of microstructural optimization, but the density field is represented through the NN's weights and biases. The main characteristics of the proposed NN framework are: (1) it supports automatic differentiation, eliminating the need for manual sensitivity derivations, (2) smoothing filters are not required due to implicit filtering, (3) the framework can be easily extended to multiple-materials, and (4) a high-resolution microstructural topology can be recovered through a simple post-processing step. The framework is illustrated through a variety of microstructural optimization problems.
Sampling, Intervention, Prediction, Aggregation: A Generalized Framework for Model Agnostic Interpretations
Scholbeck, Christian A., Molnar, Christoph, Heumann, Christian, Bischl, Bernd, Casalicchio, Giuseppe
Non-linear machine learning models often trade off a great predictive performance for a lack of interpretability. However, model agnostic interpretation techniques now allow us to estimate the effect and importance of features for any predictive model. Different notations and terminology have complicated their understanding and how they are related. A unified view on these methods has been missing. We present the generalized SIPA (Sampling, Intervention, Prediction, Aggregation) framework of work stages for model agnostic interpretation techniques and demonstrate how several prominent methods for feature effects can be embedded into the proposed framework. We also formally introduce pre-existing marginal effects to describe feature effects for black box models. Furthermore, we extend the framework to feature importance computations by pointing out how variance-based and performance-based importance measures are based on the same work stages. The generalized framework may serve as a guideline to conduct model agnostic interpretations in machine learning.
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- Asia > Middle East > Jordan (0.04)
A Generalized Framework for Population Based Training
Li, Ang, Spyra, Ola, Perel, Sagi, Dalibard, Valentin, Jaderberg, Max, Gu, Chenjie, Budden, David, Harley, Tim, Gupta, Pramod
Previous PBT implementations have been synchronized glass-box systems. We propose a general, black-box PBT framework that distributes many asynchronous "trials" (a small number of training steps with warm-starting) across a cluster, coordinated by the PBT controller. The black-box design does not make assumptions on model architectures, loss functions Figure 1: Black-box Service for Population Based Training or training procedures. Our system supports dynamic hyperparameter based on a Worker-Controller framework. Each solid blue schedules to optimize both differentiable and non-differentiable circle represents a training trial. A black arrow represents a metrics. We apply our system to train a state-of-the-art WaveNet trial dependency (usually for warm-starting the model from generative model for human voice synthesis. We show that our PBT a parent's checkpoint) and a gray arrow represents an unselected system achieves better accuracy and faster convergence compared parent trial which loses in a tournament and fails to existing methods, given the same computational resource.
- Oceania > Australia > New South Wales > Sydney (0.04)
- North America > United States > New York > New York County > New York City (0.04)
A Generalized Stochastic Variational Bayesian Hyperparameter Learning Framework for Sparse Spectrum Gaussian Process Regression
Hoang, Quang Minh, Hoang, Trong Nghia, Low, Kian Hsiang
While much research effort has been dedicated to scaling up sparse Gaussian process (GP) models based on inducing variables for big data, little attention is afforded to the other less explored class of low-rank GP approximations that exploit the sparse spectral representation of a GP kernel. This paper presents such an effort to advance the state of the art of sparse spectrum GP models to achieve competitive predictive performance for massive datasets. Our generalized framework of stochastic variational Bayesian sparse spectrum GP (sVBSSGP) models addresses their shortcomings by adopting a Bayesian treatment of the spectral frequencies to avoid overfitting, modeling these frequencies jointly in its variational distribution to enable their interaction a posteriori, and exploiting local data for boosting the predictive performance. However, such structural improvements result in a variational lower bound that is intractable to be optimized. To resolve this, we exploit a variational parameterization trick to make it amenable to stochastic optimization. Interestingly, the resulting stochastic gradient has a linearly decomposable structure that can be exploited to refine our stochastic optimization method to incur constant time per iteration while preserving its property of being an unbiased estimator of the exact gradient of the variational lower bound. Empirical evaluation on real-world datasets shows that sVBSSGP outperforms state-of-the-art stochastic implementations of sparse GP models.
- Transportation (1.00)
- Consumer Products & Services > Travel (0.67)
Transfer Learning Framework for Early Detection of Fatigue Using Non-invasive Surface Electromyogram Signals (SEMG)
Chattopadhyay, Rita (Arizona State University) | Ye, Jieping (Arizona State University) | Panchanathan, Sethuraman (Professor and Deputy Vice President of Research and Economic Affairs, School of Computing, Informatics, and Decision Systems Engineering, Computer Science and Engineering Faculty)
The fundamental assumption being, any hypothesis found to approximate well over a sufficiently large Surface Electromyogram (SEMG) signals are physiological set of training examples will also approximate well over signals processed to assess the intensity of activity and the other unobserved examples (Mitchell 1997), belonging to fatigue state of the muscles, non-invasively (Kumar, Pah, the same distribution as the training data. But if this basic and Bradley 2003; Georgakis, Stergioulas, and Giakas 2003; assumption is violated as in the case of SEMG data over Koumantakis et al. 2001; Gerdle, Larsson, and Karlsson multiple subjects, direct application of traditional data mining 2000). However researches observed significant difference and machine learning methods would not work. Figure 1 between the data collected from different subjects shows a typical distribution of SEMG data for three different though they performed the same activity under similar experimental subjects, collected over a fatiguing exercise at varying speed conditions (Contessa, Adam, and Luca 2009; representing the four physiological phases corresponding to Gerdle, Larsson, and Karlsson 2000). Because of their four classes (l) low intensity of activity and low fatigue, (2) highly subject specific nature the SEMG based fatigue assessment high intensity of activity and moderate fatigue, (3) low intensity requires subject specific calibration and are hence of activity and moderate fatigue and (4) high intensity confined to clinical environments related to training and rehabilitation. of activity and high fatigue.